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ABSTRACT 

In this paper we present an algorithm for optimal processing 
of time-dependent sequenced route queries in road networks, 
i.e., given a road network where the travel time over an edge 
is time-dependent and a given ordered list of categories of 
interest, we find the fastest route between an origin and des¬ 
tination that passes through a sequence of points of interest 
belonging to each of the specified categories of interest. For 
instance, considering a city road network at a given depar¬ 
ture time, one can find the fastest route between one’s work 
and his/her home, passing through a bank, a supermarket 
and a restaurant, in this order. The main contribution of 
our work is the consideration of the time-dependency of the 
network, a realistic characteristic of urban road networks, 
which has not been considered previously when addressing 
the optimal sequenced route query. Our approach uses the 
A* search paradigm that is equipped with an admissible 
heuristic function, thus guaranteed to yield the optimal so¬ 
lution, along with a pruning scheme for further reducing the 
search space. In order to compare our proposal we extended 
a previously proposed solution aimed at non-time dependent 
sequenced route queries, enabling it to deal with the time- 
dependency. Our experiments using real and synthetic data 
sets have shown our proposed solution to be up to two orders 
of magnitude faster than the temporally extended previous 
solution. 

1. INTRODUCTION 

The optimal sequenced route (OSR) query was originally 
introduced in [^. It aims at finding the optimal route from 
an origin location passing through a number of points of 
interest (POIs), each belonging to a specific sequence of 
categories of interest (COIs). This query has several ap¬ 
plications within location-based services or car navigation 
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systems, e.g., planning a trip where one leaves from work 
towards a bank to withdraw money and then to a shop¬ 
ping mall. Note that “banks” and “shopping malls” form two 
COIs, each with many possible POIs. More importantly, al¬ 
though there may be many banks and shopping malls in the 
city, an OSR query chooses the one bank and the one shop¬ 
ping mall that, in this order, minimize the total cost, such 
as the travel time, of the trip. Some variations and solutions 
to the OSR query have been proposed in and El (cf. 
Section . 

However, the “standard” OSR query does not take into 
consideration a single destination that needs to be reached, 
e.g., after satisfying the COI traversal criterion one may need 
to go home. More importantly, on a typical road network 
the time a user requires to traverse an edge depends on the 
departure time, i.e., the fact that the road network is a time- 
dependent graph. We, on the other hand, take this char¬ 
acteristic as an intrinsic part of the problem and propose, 
as our main contribution, an efficient algorithm to solve the 
new optimal time-dependent sequenee route (OTDSR) query, 
defined as follows: 

The OTDSR(TDG, s,d,t,Qc) query takes as in¬ 
put a time-dependent graph TDG and returns a 
route for a user departing at time t from s to¬ 
wards d, visiting exaetly one P01 from eaeh COI 
in the ordered list Qc, sueh that this route is op¬ 
timal in terms of total travel time. 

Since time is essential in this case, we also consider the 
time the user will spend at each POI (such assumption is 
not considered nor necessary in the standard OSR query). 
The case where that such time is itself time-dependent (e.g., 
being served at a restaurant may take longer at noon than 
at 11am or 1pm regardless of the user) can be trivially ad¬ 
dressed by simply substituting a POI node with two nodes, 
an “entry” and an “exit” node, modelling the time spent at 
the POI as the time cost as the time to travel between 
such nodes. That is, the problem remains essentially the 
same and our proposed solution is readily applicable. The 
only characteristic that changes is the size of the network, 
namely the number of POI nodes would double. Note that 
this definition does not prevent one to visit a given COI more 
than once, nor having the origin and destination as the same 
points in the TDG. That is, a path such as: leaving home, 
going to a restaurant, a movie theatre, a shopping mall, an- 


other restaurant and then finally back home is a perfectly 
legitimate route. 

We also make the practically reasonable assumption that 
the (time-dependent) travel cost of each edge in the TDG 
satisfies the FIFO property, i.e., an object that starts travers¬ 
ing an edge first has to finish traversing this edge first as 
well. The general time-dependent shortest path problem in 
TDGs is NP-hard [^, but it has a polynomial time solution 
in FIFO networks. In the context of our problem, the FIFO 
property guarantees that there is no improvement in travel 
time if one “waits” at a vertex for the “best time” to traverse 
it, thus in what follows such “waiting” is not allowed. 

Finally, our proposed solution is generic in the sense that 
it can be also readily applicable to the standard (i.e., non¬ 
time dependent) OSR query as well. Any OSR query in¬ 
stance can be trivially converted to an OTDSR query in¬ 
stance where the edge costs are constant and the time spent 
at each POI is null. 

In order to exemplify the definitions presented above, con¬ 
sider the graph shown in Figure^ which is a representation 
of a simple TDG. The curves shown in Figure represent 
the travel time costs of each edge of the network. Let us con¬ 
sider an OTDSR(TDG, s, d, t, Qc) query where t = 18 and 
Qc = [Cb , Cr] . For the sake of simplicity, but without loss 
of generality, we assume that the time spent at each GOI 
depends only on the GOI itself, i.e., it is equal for all POIs 
in the same category, e.g., the time spend at any bank in Cb 
and any restaurant in Cr is 15 min and 60 min respectively. 


(s,bl) = (bl,s) 


(bl,v2) = (v2,bl) 



(vl,v2) = (v2,vl) 


(vl, d) = (d,vl) 



(v2,r2) = (r2,v2) 



time 






Figure 1 : A small TDG instance with three different 
COIs: banks Cb = {bi and ^2}, restaurants Cr = {ri 
and r2} and a gas station Cg = {di}? ^ starting point 
point s and a destination node d. Note that vi and 
V2 are not POIs, but rather just road intersections.. 

A greedy approach to solve the standard OSR problem is 
to perform independent NN searches locally. The OSR query 
cannot be optimally solved by using such an approach 
and this is also true for the OTDSR query. Indeed, if we 
were to use such a greedy approach for the query instance 
above we would need to perform a 1-NN query for time- 
dependent networks, following the approach in [^, twice. 
First we would find the travel time-wise closest node v G Cb 
from s at t = 18, which would be bi. The cost of traversing 
the edge (s,5i) at 18:00 is denoted as C(s^ 5 ^)( 18 : 00 ) and is 
equal to 6 min. Thus, the arrival time at bi would be 18:06. 
After staying at the POI for 15 min, the user would leave the 
node (i.e. bank) at 18:21. The next step would be to locate 
the closest node u G Cr from bi at that time. The node 
r 2 is found through the path ( 5 i,U 2 ,r 2 ) with cost 13 min 
(via (51,u2,r2), costing 6+7 min) and the arrival time in r 2 
would be 18:34. After spending 60 min at the restaurant, the 
user would then leave r 2 at t =19:34. Finally the destination 
d would be reached with cost equal to 19 minutes through 


s 



Figure 2 : Sample cost functions (travel time in min 
X time of day) for the network in Figure 


(r 2 , U 2 , ui, d) arriving at 19:53 . Thus, the total travel time of 
the route (s, 5i, U 2 , r 2 , U 2 , ui, d) is 38 minutes. However, the 
optimal one, i.e., the fastest route, is actually (s, ui, ^ 2 , ri, d) 
with travel cost equal to 21 minutes. Note that as those are 
the total travel time, it does not include the time spent at 
the POIs, since this time is set by the user and will be the 
same in any solution. Nonetheless it must be accounted for 
during the query processing as it will influence the departure 
time (hence travel cost) from any POI. 

The main contribution of this paper is a method to find 
optimal solutions for OTDSR queries. The method is based 
on the A* search algorithm that is equipped with a suit¬ 
able and provably admissible heuristic function, which guar¬ 
antees the optimality of the search result. For comparison 
purposes, we also present a solution that is obtained by ex¬ 
tending the also optimal PNE (Progressive Neighbor Ex¬ 
ploration) approach proposed in to cope with the time- 
dependency characteristic of our problem of interest. Our 
extensive experiments using synthetic and real networks show 
that our algorithm is more efficient than the one using the 
extended PNE approach. 

The remainder of this paper is structured as follows. In 
Section!^ we present a brief discussion of related works. In 
Section we present our proposed approach, and also how 
we extended the PNE approach. The experimental evalua¬ 
tion and results are shown in Section Einally, Section 
presents a summary of our findings and some suggestions for 
further work. 






















2. RELATED WORK 

The Trip Planning Query (TPQ) was originally proposed 
in [^. Differently from the OSR query as well as from the 
OTDSR query, no order among COIs is imposed in TPQs, 
which leads to its NP-Hardness and thus the use of approx¬ 
imation algorithms. 

Optimal solutions to the OSR query in vector and metric 
spaces were first proposed in and later extended in [^. 
For vector spaces, presented a light threshold-based iter¬ 
ative algorithm named LORD, that utilizes various thresh¬ 
olds to filter out the locations that cannot be in the opti¬ 
mal route. The authors also proposed R-LORD, an exten¬ 
sion of LORD which uses R-trees to examine the threshold 
values more efficiently. For metric spaces and arbritrary 
size of the COI sequence, they proposed the PNE approach 
that progressively applies nearest-neighbor queries on differ¬ 
ent point types to construct the optimal route for the OSR 
quer^Q The main goal of PNE is to replace expensive dis¬ 
tance computations of LORD. In the authors proposed 
another approach which is applicable for both vector and 
metric space. They exploited the geometric properties of 
the solution space and theoretically proved its relation to 
additively weighted Voronoi diagrams which are recursively 
accessed to incrementally build the OSR. That approach is 
not suitable for comparison to the one we propose because 
they compute the solution “backwards,” i.e., from the last 
POI to the first one in the desired sequence. This is not 
feasible in a time-dependent setting since one would have to 
know the arrival time at the last POI beforehand, which is 
itself a function of the departure time that is specified at 
query time. Thus, we chose to compare our approach to the 
earlier PNE proposal. 

In 1^ the authors proposed a solution that is based on 
the Incremental Euclidean Restriction (lER) [^. It first 
finds the shortest OSR given by searches in the Euclidean 
space, verifies its length in the road network and this value 
is used as an upper bound. All OSRs that have a length 
smaller than this value also have the potential to be the 
shortest route in the road network. Therefore, they must be 
searched in the Euclidean space, and then the results must 
be verified in the road network. The shortest of them is 
returned as the result. 

In two algorithms that perform an unidirectional and 
a bidirectional search, respectively, were proposed. Both of 
them are controlled by an A* algorithm and use the Eu¬ 
clidean distance between a vertex and the destination as the 
guiding heuristic function. The authors also proposed using 
a structure called “visited POI graph”, in order to reduce 
multiple node expansions. 

Two other techniques were presented in [^. The first, 
named Iterative Doubling, is an improvement of the EDJ 
solution [^, a Dijkstra-based approach. The OSR is found 
by performing I Dijkstra runs on the graph, where I is the 
number of COIs that must be found. Making use of the fact 
that sequenced route queries tend to be mostly local, this 
solution is improved by avoiding exploring facilities that are 
too far. The second solution is based on an extension of the 
contraction hierarchy , that was originally proposed as a 
pre-processing step for ordinary shortest path queries. 


^ Since we compare our proposal to a time-dependent ex¬ 
tension of the PNE approach, we discuss it in more details 
in Section 


Another query related to the one addressed in this work, 
named multi-rule partial sequenced route (MRPSR) query, 
was investigated in [^. This query generalizes the OSR 
and the TPQ queries, by translating ordering constraints 
into rules. When the set of partial sequence rules is empty 
this query is equivalent to the TPQ query and when the 
set of partial sequence rules contains one partial sequence 
rule specifying the order given by the user, this problem is 
identical to the OSR problem. 

Several other works have studied the problem of finding 
the fastes t route bet wee n two locations in time-dependent 
networks 14 15 However, these works do not deal 
with the need to visit specific COIs when traveling between 
locations. 

In the authors proposed a travel planning problem 
which consists of finding the best travel plan from an ori¬ 
gin to a destination following a predetermined sequence of 
POIs in a transportation network with deterministic time- 
dependent travel times. Other works have considered the 
problem of finding the best trip plan which visit several pre¬ 
determined POIs (no order is specified) [^. Differently 
from those works, our OTDSR query chooses one (of pos¬ 
sibly several) POI from each COI according to a specified 
COI sequence in order to minimize the route’s total travel 
time. 

In the authors proposed the traffic-aware route search 
(TARS) query which aims at finding the fastest route from 
an origin to a destination via POIs of specified types, while 
taking into account the time-dependency of the network and 
the possibility that some visited entities will not satisfy the 
user. A TARS query may include temporal and order con¬ 
straints that restrict the order by which entities are vis¬ 
ited. Since it is a NP-hard problem, as it is a generalization 
of the Traveling-Salesman Problem, three heuristics to an¬ 
swer TARS queries were presented. Differently from this 
approach, we assume that a total order over the COIs is 
given and, thus, we are able to find an optimal solution. 


3. SOLUTIONS TO THE OTDSR QUERY 

We propose two solutions to solve the OTDSR problem 
optimally. We first present a baseline solution which is based 
on the progressive neighbor exploration (PNE) approach in 
PNE is non-time-dependent thus we extended it in or¬ 
der to cope with the time dependency characteristic of the 
OTDSR query. The resulting approach, which we name TD- 
PNE, uses the TD-NE-A* algorithm to perform the nec¬ 
essary time-dependent NN local searches. Next, we propose 
our solution, called TD-OSR, which uses an A* search to 
guide the network expansion; we also propose a scheme to 
reduce the number of node re-expansions. 

3.1 Pre-processing step 

Aiming at reducing the execution time of the query, a pre¬ 
processing step is performed in both solutions, TD-PNE and 
ours, in order to calculate bounds to guide the expansion of 
vertices. 

Particularly, the TD-PNE solution uses the pre-computed 
bounds in the TD-NE-A* algorithm , which is used to find 
local time-dependent nearest neighbors. The original TD- 
NE-A* algorithm aims at finding the k closest POIS from a 
starting point s at a given departure time t by performing 
an A* search. That algorithm pre-computes lower bounds 
to reach the closest POI, belonging to any COI, from every 








vertex in the network. These bounds are used as the heuris¬ 
tic function in the A* search. As we are interested only in 
POIs belonging to the sequence of COIs given as input, we 
modified that algorithm so that, in the pre-processing step, 
we calculate optimistic estimates to reach the closest POI 
in each COI, from each v E V. Thereby, the search can be 
guided towards a specific COI. 

In order to calculate these lower bounds, independently 
of a departure time, we construct a lower bound graph G. 
Given a TDG G, G is a graph that has the same set of ver¬ 
tices V and edges E sls G but the cost of an edge G is given 
by the minimum cost possible to traverse the same edge in 
G; G is thus an optimistic non-time dependent version of G. 
For example, considering the TDG shown in Figure ^ the 
cost of the edge (s, &i) in G would be equal to 5 minutes. 

Once G is constructed, we pre-compute, from every v eV 
and for every Gi E G, the cost L{v,Gi) that is the cost of 
the fastest path from v to its nearest POI belonging to the 
category Gi in G. Note that we do not calculate the cost to 
reach every POI of the network from every v E V. Moreover, 
these costs are calculated for every COI because the COI 
order given as the query input may include any COI and 
they are not known in advance. 

In order to compute these cost estimates, we execute Di- 
jkstra’s algorithm from each vertex v to find its nearest POI 
from each category in G. In the worst case, this algorithm 
runs in time 0{\E\ log |y|) for each execution. As we start 
a search from each vertex, the total complexity in the worst 
case is 0(|y| |F^| log \ V\). It is noteworthy to emphasize that 
this is an one time cost which will be amortized over time. 
Also, the calculated costs do not tend to be updated fre¬ 
quently since they are generally given by the time to traverse 
the roads when it is possible to drive at the maximum al¬ 
lowed speed. Regarding space complexity, the space required 
to store all the information calculated in the pre-processing 
step is equal to |C| |y| bytes, where \G\ is the number of 
categories of POIs. 

3.2 PNE’s Time-dependent Extension 

In order to have a baseline solution, we propose a time- 
dependent extension of the PNE algorithm, named TD- 
PNE. The PNE algorithm generates the optimal route pro¬ 
gressively by performing local NN searches in road networks 
with static costs. 

Eor the sake of completeness, Eigurej^ illustrates how the 
original PNE algorithm works. Let us suppose that one 
departs from the vertex s and wants to visit a bank and a 
restaurant, in this order. The PNE algorithm first looks for 
the nearest bank from s, 5i, and stores this bank and the 
cost to reach it in the heap (also shown in Eigure[^. Next, 
it expands the entry with minimum cost in the heap, (bi: 4), 
and then it looks for the second nearest bank from s and for 
the nearest restaurant from bi . It finds b 2 with cost 5 and r 2 
with cost 9. The entries (^ 2 : 5) and ( 61 , r 2 : 13) are inserted 
in the heap. Note that a complete route (&i,r 2 ) was found. 
The cost of this route is then used as an upper bound, i.e, 
routes with cost greater than 13 are not investigated. The 
expansion continues in the same way and the route (s, ^ 3 , r 2 ) 
with cost 9 is returned as the optimal one. 

We modified that algorithm such that each NN search 
accounts for the dependency on departure time and is per¬ 
formed using the TD-NE-A* algorithm [^. The latter is 
equipped with the new heuristic described above, that takes 


s 



step 

heap contents 

1 

{bi 

4 ) 

2 

(62 

5 ), (61,ra: 13 ) 

3 

(63 

7 ), (62,ri: 12) 

4 

(^3, ^2: 9 ) 


Figure 3 : Entries stored in the heap and the POIs 
found during the execution of PNE algorithm. 


into account the next COI to be visited in the sequence. Eor 
instance, the time-dependent NN query that starts from s is 
guided towards its closest banks and takes into account the 
departure time from s. 

3.3 The TD-OSR Algorithm 

We now present TD-OSR, our main contribution in this 
paper. It uses an A* search to guide the network expansion 
so that vertices with more potential are examined first. To 
measure the potential of a vertex v, a function f{v) = g{v)-\- 
h{v) is used, where g{v) is the current cost from the starting 
point s to , and h{v) is a heuristic function, which in our 
case is a lower bound estimate of the time to pass through 
all the categories of POIs and to reach the destination. The 
smaller the value given by the sum of the current cost to 
a vertex plus the heuristic function value for it, the greater 
its potential. If the estimate h{v) is a lower bound to the 
actual cost —which is the case as we shall show shortly— 
f{v) is said to be admissible and therefore, by virtue of the 
A* design the solution, i.e., the path found in the TDG, is 
guaranteed to be optimal. 

In order to calculate the h{.) value, we need cost esti¬ 
mates to reach the COIs in the graph as well as to reach the 
destination d. The first is computed in the pre-processing 
step (described above) and is independent of the sequence 
given as input. The calculation of the estimate to reach the 
destination is more difficult because it can potentially be 
any node of the network. To pre-calculate the shortest path 
from every node to every other node that could be set as d is 
an impractical option. However, once d is given as a query 
input, we can compute the shortest path from every vertex 

V eV to dhy running Dijkstra’s algorithm (one-to-all short¬ 
est paths) from d in the reverse graph of G only once. A 
reverse graph of G is a graph with the same set of vertices 
as G, but the edges are reversed, i.e., if G contains an edge 
{u^v) then the reverse of G contains an edge (r’,!^). 

Once we have the estimates to reach the categories of POIs 
and to reach the destination computed for any vertex we 
can calculate the heuristic function h(y) as follows: 

Given the sequence of COIs Gq — (Cg, Cg,..., C^) and 
considering that a path from the starting point s to a vertex 

V has already passed by POIs of the categories Cg,...,CJ, 










the heuristic function value for v is given by: 

h{v, Cq[i-\-l...m],d) — max{L{v^ L(u, C^), L(u, d)} 

( 1 ) 

where L(u, CJ) is the cost of the shortest path from v to 
its nearest POI of the category in G and L{v,d) is the 
estimate to reach d from v. In other words, the cost to pass 
through the categories of POIs in Cq = (Cg, Cg, C^) and 
to reach the destination is at least the minimum cost to reach 
the farthest of them. 

Let us define two concepts and their notations (which 
will also be used in the algorithm shortly). We define as 
the Travel Time (TT) of a mobile user the accumulated 
cost of traversing all edges in its current path until reach¬ 
ing the current node u, and we denote it by TTy. Simi¬ 
larly, we denote the Arrival Time (AT) of the mobile user 
at a node v as ATy. Moreover, we denote ATq = t (the 
query’s departure time) and when traversing an edge be¬ 
tween node u and u, we have ATy — ATy + Cu^;(AT^^). 
If we denote TT{v^d^ ATy^Cq[i + l...m]) as the remaining 
travel time leaving from v at time ATy towards d through 
POIs in the sequence Cq[i + l...m], it suffices to prove that 
h(v, Cq[i + l...m], d) < TT{v, d, ATy,Cq[i + l...m]), that is: 

Lemma 3.1 /i(u, Cg[z + l...m], d), as defined in Equation^ 
is a lower hound to the aetual optimal remaining eost to 
travel from v to d. 

Proof. Let us assume that we have: 

h(v, Cq[i + l...m], d) > TT(u, d, ATy, Cq[i + 1 ...?ti]) 
or equivalently that: 

max{L{v, 0^+^),L{v, C'^),L{v, d)} > 

TT{v,d, ATv,Cq[i + 

We consider two cases: the maximum value is given by 
L(v, Cq) for z + 1 < j < m or by L(v, d). In the first case we 
have L{y,C^q) > TT{v,d,ATy,Cq[iA- l...?n]), which is not 
possible because the path from u to d has to include POIs of 
the categories Cq[i + l...m] in the specified order including 
a POI of the category Cg and thus, the cost of this path can 
not be greater than the lower bound to the cost of reaching 
a POI belonging to Cq from v. In the second case we have 
L{v,d) > TT{v, d, ATy, Cq[i -\- which is also not pos¬ 

sible because L{v,d) is a lower bound to the cost of reaching 
d from v. Therefore, the assumption must be incorrect and 
the proof is complete. □ 

Thereby, the heuristic function that guides the A* search 
in our approach is admissible and therefore the obtained 
path is guaranteed to be optimal. 

Algorithm shows the pseudo-code for the TD-OSR al¬ 
gorithm. First, the algorithm loads the reverse graph of G, 
denoted as G^. Then, Dijkstra’s algorithm is used to cal¬ 
culate the costs of the fastest paths from d to every v E V 
in G^. These costs, which are estimates to reach the desti¬ 
nation, are stored in vector destination. This initialization 
step is shown in the lines 1 and 2. 

Next, the algorithm begins the network expansion and 
inserts s in a priority queue Q (line 7) which stores the set 
of candidates for expansion in the next step. An entry in 
Q is a tuple {vi, ATvi,TTvi, LBvi, Rvi), where ATvi is the 
arrival time at vi, TTvi is the travel time from s to LBvi 
is given by TTvi + h{vi,C[i...m]), calculated according to 


Algorithm 1: The TD-OSR algorithm 
Data: A starting point s G R, the departure time 
t G [0,T], a sequence C = [C \,..., C^] and a 
destination d G R 

Result: The fastest route from s to d considering the 
departure time t and selecting one POI from 
each category in C according to the given 
sequence 

1 G_^ ^ loadReverseG; 

2 destination[] ^ Dijkstra(G^,d); 

3 TTs ^ 0; 

A ATs^t ; 

5 LBs ^ h{s,C[l...m],d); 

e Rs ^ []; 

7 Enqueue {s, ATs,TTs, LBs, Rs) in Q; 

8 while Q 7^ 0 do 


9 

10 
11 
12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 

29 

30 

31 

32 

33 

34 

35 

36 

37 

38 end 


{u, ATu, TTu, LBu, Ru) ^ Dequeue Q; 
add u to Removed[\Ru\]] 
u — d and \Ru\ = \C\ then 
I Return Ry ; 

end 

for v G adjacency{u) do 
next ^ |Ru| + 1; 

Ry i Ru"') 
spent ^ 0; 
if u G C[next] then 

spent i '^C[next]-) 

Ry[next] ^ v; 
next ^ next + 1; 

end 

TTy ^ TTy + C(^y^yfiATy); 

ATy ^ {t A- TTy + spent) mod T; 

LBy ^ TTy + h{v, C[next...m],d)-, 
if V was not removed in a position > |R^| then 
if V is not in Q then 
I Enqueue (y, ATy, TTy, LBy, Ry) in Q; 
else 

length ^ |; 


lb ^ LBy, 


'inQ ’ 


if length < and lb > LBy then 
I Update {v,ATy,TTy,LBy,Ry) in Q; 

end 


end 
end 


end 


Equation Q and Rvi stores the ordered list of POIs that 
have already been reached in the path from s to Vi. The 
priority of elements in Q is given by increasing order of LBvi, 
thus vertices that offer a greater chance to reach the POIs 
in C and the destination quickly are checked first. 

The vertices are dequeued from Q (line 9) and expanded. 
When a vertex u is dequeued from Q it is marked as removed 
(line 10) in the list corresponding to the number of POIs that 
have already been reached in the path to it. Eor example, 
when s is dequeued, it is inserted in the list Removed^)], 
meaning that no POI, according to the given sequence C, 
was reached in the path to s. Eor every v adjacent to u we 
check if it is a POI and if it belongs to next category in the 










sequence. If this condition is satisfied (which is verified in 
line 18), v is inserted in the next position of Rv and the 
number of POIs that belong to the sequence found in the 
path to V is incremented. Furthermore, we add the time the 
user expects to spend at a POI of the next category, Tc[next ], 
to the arrival time at the neighbors of v (line 19). 

For each u, we check whether it is in Removed[\Rv\] ... 
Removed[m] (line 26). If it is in any of these lists, it is not 
beneficial for us inserting u in Q, since it has been already 
found in a path that includes more POIs and has a lower 
LBy. If it is not in any of these queues and neither in Q 
(which is verified in line 27) it is inserted in Q. If it is in 
Q, we check if the new path to it includes more POIs in 
sequence than in the old one in Q and if LB^ (of the new 
entry) is less or equal to the one in Q (line 32). If these 
conditions are satisfied, the old entry of v is removed from 
Q and the new one is inserted. By doing this, we avoid 
re-expanding vertices unnecessarily. 

The algorithm stops when the next vertex expanded is the 
destination d and the path from s to d, Rd^ includes all the 
categories of POIs in C according to the visiting sequence 
as shown in lines 11 and 12. 

3.3.1 Complexity Analysis 

Before analyzing the complexity of our proposed solution, 
we need to prove the following property regarding the num¬ 
ber of times that a vertex is expanded. 

Lemma 3.2 A vertex is expanded at most \Cq \ +1 times in 
the TD-OSR algorithm. 

Proof. Let us assume a vertex v is expanded more than 
Cg + 1 times, i.e., v was inserted in Q at least Cg + 2 times. 
As a path from s to u may include from 0 to |Cg| POIs that 
follows the given sequence, v has been found through at least 
two different paths that include the same number of k POIs 
and the corresponding entries were inserted in Q. However, 
this cannot happen since v is only inserted in Q if it has not 
already been found in another path that includes at least 
k POIs, therefore the assumption is false and the lemma is 
proven. □ 

Every time the main loop executes, one vertex is extracted 
from the queue. Since there are V vertices in the graph and 
each vertex in inserted in Q at most Cg +1 times, the queue 
may contain 0(|Cg| |y|) vertices. Each removal operation 
takes 0(log |Cg| |y|) time. Thus, the total time required to 
execute the main loop is 0(|Cg| |y| log lOgllEl). Since each 
vertex is expanded at most Cg + 1 times, at most 0(|Cg| \E\) 
edges are traversed during the expansion of vertices. Eor 
each edge, its ending vertex can be inserted or updated 
in Q, which takes 0(log|Cg| \V\) time. Therefore, the to¬ 
tal run time is 0(|Cg| |y| log |Cg||y| + |Cg| \E\ log |Cg||y|) = 

Oi\C,\\E\log\C,\\V\). 

4. EXPERIMENTS 

We performed experiments using both synthetic datasets 
and a real dataset from the Attica region in Greece. We 
compared TD-OSR, our proposed solution, to the TD-PNE 
solution according to the (logarithmic) number of expanded 
vertices in the search and the (logarithmic) processing time 
of the query. 

4.1 Synthetic network 


We generated a synthetic time-dependent road network as 
a regular grid where each grid point corresponds to a vertex 
in the TDG. The POIs were uniformly distributed over the 
network and each category has, in average, the same number 
of POIs. To generate the edges cost, for each edge, we chose 
a random speed between 30 km/h and 80 km/h for each 60 
minutes interval of the day, our chosen time granularity, so 
that the EIEO property is observed within the network. 


Network Size (x 1,000) 

25, 50, 100 

Density of POIs 

0.5%, 1%, 2% 

Vertex degree 

2, 2.5, 3 

Number of COIs 

5, 10, 20 

Sequence Size 

1, 3, 10 

Query Locality 

5%, 15%, 50% 


Table 1: Experimental parameters and their values 
(bold defines default values). 

We evaluated how each of the solution perform with re¬ 
spect to the parameters shown in Table Regarding the 
network, we varied its size (i.e., number of nodes), the den¬ 
sity of POIs (i.e., the number of POIs over the number of 
vertices), the vertices degree and the number of COIs in the 
network. Eurthermore, we investigated how the algorithms 
behave in relation to the size of the COI sequence given as 
input and the distance between the query and the destina¬ 
tion vertices, which we refer to as “Query Locality”. This 
distance is a percentage of the network diameter and serves 
to show how harder the problem becomes as this distance 
increases. Eor each experiment, we varied a parameter and 
set the others parameters to their default values. Eor each 
combination of different parameters, we generated one TDG 
and executed 10 randomly generated queries. 

Effect of the network size. Eigure shows how the 



network size 



network size 

Figure 4: Processing time of the queries and num¬ 
ber of expanded vertices when the network size in¬ 
creases. 
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solutions behave according to the network size. Both the 
number of vertices expanded and the processing time in¬ 
crease with the network size in all solutions. This can be 
explained by the existence of a greater number of paths 
that can lead to promising POIs in a larger network. It 
is important to notice that the TD-PNE solution is more 
affected by this variable than the TD-OSR solution. This 
happens because the TD-NN searches performed within TD- 
PNE become more costly with an increase in the number of 
vertices. The TD-PNE solution also investigates all the pos¬ 
sible routes which have a cost within the cost of the shortest 
route found so far, not accounting for (an estimate of) the 
cost of reaching the destination, unlike the TD-OSR algo¬ 
rithm. This ultimately leads to investigating routes that are 
not very promising. 

Effect of the density of POIs. The processing time 
of the queries and the number of expanded vertices decrease 
with the density of POIs for the TD-OSR solution whereas it 
increases slightly for the TD-PNE solution, as shown in Eig- 
ure When the number of POIs in the network increases, 
it is easier to find a POI, and thus a route, simply because 
there are more choices of POIs. This explains the behaviour 
of the TD-OSR solution. On the other hand, in the TD- 
PNE solution, even if the TD-NN searches are faster when 
the POIs become denser, the number of candidate routes 
to be investigated is greater. With an increasing number of 
POIs, there is a greater chance of finding partial routes that 
have a small cost and these routes will be investigated as 
they offer (misleadingly) a chance to be the optimal route. 
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Figure 5: Processing time of the queries and num¬ 
ber of expanded vertices when the POI density in¬ 
creases. 


Effect of the degree of the vertices. As expected and 
shown in Figure the processing time of the queries and 
the number of expanded vertices increase with the degree 
of the vertices in all solutions. This is reasonable since the 
number of paths in a network where the vertices have a 
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Figure 6: Processing time of the queries and number 
of expanded vertices when the degree of the vertices 
increases. 


higher degree is greater, making it more difficult to find POIs 
and consequently a route. 

Effect of the number of COIs in the network. Fig¬ 
ure shows that the processing time of the queries and 
the number of expanded vertices increase with the num¬ 
ber of COIs in the network in the TD-OSR solution while 
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Figure 7: Processing time of the queries and number 
of expanded vertices when the number of COIs in 
the network increases. 
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it decreases in the TD-PNE solution. Since the POIs are 
uniformly distributed among the categories, increasing the 
number of categories, decreases the number of POIs per 
COL Thus, it is harder to find a POI that belongs to a 
certain category. This explains the behavior of the TD-OSR 
solution. On the other hand, the TD-PNE solution takes 
advantage of this because it limits the number of candidate 
routes to be investigated. 

Effect of the sequence size. As expected and shown 
in Eigurej^ the processing time of the queries and the num¬ 
ber of expanded vertices increase with the size of the COI 
sequence in both solutions. Generally speaking, this is be¬ 
cause more vertices have to be checked to find a sequence 
with more POIs. Although this variable does not influence 
the cost of a single TD-NN search, more of those searches 
need to be performed in the TD-PNE solution when the se¬ 
quence size increases. Eurthermore, in the TD-PNE solution 
there will be more candidates routes to be investigated. We 
can also notice that this variable significantly affects the TD- 
OSR solution. The greater the number of COIs, the higher 
the effort of this solution since there are more possible com¬ 
binations of POIs to form a route that follows the sequence 
and more of those candidate routes will be investigated. 
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Figure 8: Processing time of the queries and number 
of expanded vertices when the sequence increases. 

Effect of the locality of the query. Figure shows 
that the cost of all solutions increase with the distance be¬ 
tween the origin and the destination. The larger the dis¬ 
tance between the origin and the destination, the greater 
the number of candidate shortest paths between those two 
locations. Particularly, more routes will be investigated in 
the TD-PNE solution because the upper bound found in 
this solution tends to be high and many partial routes with 
cost within this upper bound may be found. This involves a 
larger number of TD-NN searches, which explains why this 
solution is the most affected by this variable. 


Figure 9: Processing time of the queries and number 
of expanded vertices when the distance between the 
origin and the destination increases. 


The road network of Attica was extracted from OS1V0 It 
has 161066 vertices, 204854 edges and a diameter of about 70 
km. We chose 20 COIs, that represent services, from the ex¬ 
isting ones in the real dataset, making a total of 1348 POIs, 
i.e,. an average of 67 POIs per COI. We divided those POIs 
in “rare”, “common” and “frequent” POIs, each such group 
having 6-7 POIs as a sample of the actual POIs according 
to their frequency. The most and less frequent POI (gas 
stations and night clubs, respectively) made for 13% and 
represented less than 1% of all POIs, respectively. An ex¬ 
ample of a “common” POI is convenience stores, representing 
5% of all used POIs. 

To generate the edges costs for the different times of the 
day, we applied the following procedure. Each edge of the 
network belongs to a class based on maximum allowable 
speed. We created one normal distribution for each edge 
class that returns the speed with which an object is moving 
on that edge. As the mean for each distribution we used the 
maximum speed of the class plus the minimum of all max¬ 
imum speeds of all edge classes divided by 2. The chosen 
standard deviation was 1/4 of the mean. For each edge, we 
got a speed value from the corresponding distribution and 
we checked if it was less than the maximum allowable speed 
for that edge class. If so, the corresponding travel time is 
found using the edge length. This time was considered as 
a base for travels between 12am and 7am and for the time 
ranges of 8am-9am, 10am-3pm, 4pm-6pm, 7pm-10pm and 
llpm-12am) this cost was multiplied by 1.7, 1.4, 1.9, 1.3 
and 1.1, respectively, to reflect periods more or less prone to 
traffic. 

Given that this is a real dataset not all parameters can be 
changed as we did before. In the following we discuss how 


4.2 Real network 


^http://www.openstreetmap.org 






















the algorithms behave in relation to the query locality and 
the size of the COI sequence. Moreover, we also investigate 
the effect of the frequency of COIs. 
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Figure 10: Processing time of the queries and num¬ 
ber of expanded vertices for different sized COIs. 
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Figure 11: Processing time of the queries and num¬ 
ber of expanded vertices on the real network when 
the sequence size given as input increases. 


Effect of POI frequency. We generated queries with 
COI sequences of size 3, but varying the frequencies of those 
COIs. We investigated the extreme cases of sequences con¬ 
taining only frequent COIs (denoted as “FFF”) and only rare 
COIs (denoted as “RRR”) as well an intermediary case with 
randomly chosen COIs (denoted as “Random”). Figure 10 


shows the results of this experiment. Both variables increase 
in the TD-OSR solution while they decrease in the TD-PNE 
solution as the POIs query sequence become rare. It takes 
more effort to find rarer POIs and this explains the behavior 
of the TD-OSR solution. Nevertheless, in the TD-PNE so¬ 
lution, there are less candidate routes to be investigated as 
the POIs become rarer, thus the improvement in its perfor¬ 
mance. Nonetheless TD-PNE is overall much less efficient 
than TD-OSR. 

Effect of the sequence size. Eigure im shows that the 
solutions exhibit the same behavior as in the synthetic ex¬ 
periments performed in relation to this variable. There is 
a greater number of candidate routes to be investigated in 
both solutions when the sequence size increases, which ex¬ 
plains the decrease in performance as the queried sequence 
gets larger. 

Effect of the query locality. E igure p^ shows that, as in 
the synthetic experiments, the larger the distance between 
the origin and the destination, the greater the number of 
candidate shortest paths between these two locations which 
increases the number of candidate routes to be investigated 
in both solutions. 



Figure 12: Processing time of the queries and num¬ 
ber of expanded vertices on the real network when 
the query locality increases. 


5. CONCLUSION 

We have addresed an important extension to the known 
Optimal Sequence Routing (OSR) query, by considering the 
underlying network to be time-dependent, i.e., the cost (travel 
time) to traverse an edge depends on the time it is initiated. 
This extension, which we call Optimal Time-dependent Se¬ 
quence Routing (OTDSR) query is a more realistic assump¬ 
tion with respect to, for instance, urban road networks. This 
formulation of the problem allows one to easily map a tra¬ 
ditional OSR query into an OTDSR query, thus making our 
proposal generic in the sense that it can also address OSR 



































queries. Finally, our solution is based on the A* paradigm 
with an admissible heuristic function that guarantees opti¬ 
mality of result. 

In order to evaluate the merits of our proposal, the TD- 
OSR algorithm, we compared it to a time-aware extension 
of an approach previously designed for the (non-time depen¬ 
dent) OSR, which we named TD-PNE. Varying a number of 
parameters, and using both synthetic and real datasets the 
TD-OSR was always signihcantly faster than the TD-PNE. 

As future work we would like to investigate a probabilis¬ 
tic version of the OTDSR problem, i.e., one where the edge 
costs are not necessarily time-dependent but follow a prob¬ 
abilistic distribution. 
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